Discovery of Unknown Events From Multi-lingual News

نویسندگان

  • Kin Hui
  • Wai Lam
  • Helen Meng
چکیده

We have proposed a new approach to detect topically-related events from multi-lingual news sources. In particular, we are interested in Chinese and English on-line newswire stories. Three categories of named entities terms, namely, people names, geographical location names, and organization names, together with the story content terms constitute the basis for story representation. The named entities are extracted by transformation-based linguistic taggers. One for Chinese and one for English. For Chinese stories, we tackle the unknown word problems by means of a hybrid solution of rule-based and statistical-based methods. To conduct event detection in multi-lingual settings, we conduct gross translation on Chinese story representation into English. One gross translation approach is the basis translation method using only a bilingual dictionary. The second approach makes use of a parallel corpus as an additional resource. Unsupervised learning technique is employed to discover new events.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Review of Hijab Discovery News Coverage in News Media

Purpose: News media play an important role in attitude towards various issues including hijab and hijab discovery. As a result, the purpose of this research was comparative review of hijab discovery news coverage in news media. Methodology: This study in terms of purpose was applied and in terms of implementation method was quantitative. The research population was the hijab discovery news in ...

متن کامل

Cross-Lingual Retrieval of Identical News Events by Near-Duplicate Video Segment Detection

Recently, for reusing large quantities of accumulated news video, technology for news topic searching and tracking has become necessary. Moreover, since we need to understand a certain topic from various viewpoints, we focus on identical event detection in various news programs from different countries. Currently, text information is generally used to retrieve news video. However, cross-lingual...

متن کامل

Cross-Lingual Trends Detection for Named Entities in News Texts with Dynamic Neural Embedding Models

This paper presents an approach to detect real-world events as manifested in news texts. We use vector space models, particularly neural embeddings (prediction-based distributional models). The models are trained on a large ‘reference’ corpus and then successively updated with new textual data from daily news. For given words or multi-word entities, calculating difference between their vector r...

متن کامل

Towards cross-lingual alerting for bursty epidemic events

BACKGROUND Online news reports are increasingly becoming a source for event-based early warning systems that detect natural disasters. Harnessing the massive volume of information available from multilingual newswire presents as many challanges as opportunities due to the patterns of reporting complex spatio-temporal events. RESULTS In this article we study the problem of utilising correlated...

متن کامل

Multi-filtering Method Based Cross-lingual Link Discovery

This paper describes cross-lingual link discovery method of ISTIC used in the system evaluation task at NTCIR-9. In this year's evaluation, we participated in cross-lingual link discovery task from English to Chinese. In this paper, we mainly describe our understanding for CLLD, the key techniques of our system, and the evaluation results.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001